#56 - Binding affinity prediction using a nonparametric regression model based on physicochemical and structural descriptors of the nano-environment for protein-ligand interactions

نویسندگان

  • Luiz Borro
  • Inacio Yano
  • Ivan Mazoni
  • Goran Neshich
چکیده

We propose a new empirical scoring function for binding affinity prediction modeled based on physicochemical and structural descriptors that characterize the nano-environment that encompass both ligand and binding pocket residues. Our hypothesis is that a more detailed characterization of protein-ligand complexes in terms of describing nano-environment as precisely as possible can lead to improvements in binding affinity prediction. Similar hypothesis has already been proven valid in case of nano-environments for protein-protein interfaces and catalytic site residues (yet to be published). INTRODUCTION In structure-based virtual screening campaigns, in silico protein-ligand complexes are evaluated and ranked according to their estimated binding affinities. Normally the ranking step is performed by using scoring functions, i.e. mathematical models that assess the strength of interaction between two binding partners. However, scoring functions are generally weak predictors of binding affinity mostly because they fail to model properly polar aspects of the protein-ligand interaction. In order to improve binding affinity prediction, we propose an empiric nonparametric predictive model derived from physicochemical and structural descriptors that characterize the nano-environment that encompass both ligand atoms and binding pocket residues. METHODS Datasets. In order to ensure an unbiased performance comparison with other related approaches, we used the PDBbind v2007 refined set, which comprises of 1300 diverse protein-ligand complexes with high quality structural and binding data. The refined set was split into two disjoint sets: a training set of 1105 used for fitting the predictive models; and a test set of 195 complexes (known as core set) for performance evaluation. Protein-Ligand complex characterization. A given protein-ligand complex is represented by physicochemical and structural parameters from the nano-environment covering the ligand atoms and binding pocket residues. In order to obtain a more detailed characterization, special attention was given to descriptors related to the hydrophobic effect as well as to polar aspects of the proteinligand binding. Descriptors were divided into three classes: Ligand-Only (7 descriptors), Protein-Only (6 descriptors) and ProteinLigand (9 descriptors), as shown in Table 1. Protein-Only descriptors and Protein-Ligand descriptors were calculated through the STING platform, whereas the Ligand-Only parameters were calculated using Biovia Pipeline Pilot. Table 1. List of descriptors used to characterize protein-ligand complexes. Class Descriptors Ligand-Only Volume, Polar Solvent-Accessible Surface Area, Strain Energy, Number of Hydrogen Bond (HB) donors, Number of HB Acceptors, AlogP, Number of Rotatable Bonds Protein-Only Hydrophobicity, Electrostatic Potential @ Surface, Unused Contacts Energy (HB, Charged, Hydrophobic, Aromatic) Protein-Ligand Protein-Ligand Interaction (HB, Charged, Hydrophobic, Aromatic), Ligand Buried Surface, Energy Density, Sponge, Density, Protein Hydrophobicity Variation Binding affinity prediction model. Using the descriptors listed on Table 1 and the experimental pKi of the training set complexes as input data, the binding affinity predictive model (herein called STING) was trained as a regression-based random forest. RESULTS & CONCLUSIONS STING’s performance was evaluated on the PDBbind benchmark v2007. Table 2 presents a performance comparison between STING and the top four previously tested scoring functions on the same benchmark. Clearly our predictive model ranks among the best with regard to binding affinity correlation, having a slightly inferior result in terms of RP when compared to RF-Score::Elem-v2. By statistically analyzing the contribution of each descriptor in the predictive model, we observed that the most important descriptors are related to shape complementarity (Ligand Buried Surface Area), hydrophobic effect (Hydrophobicity, ALogP) and polarity (Polar Solvent-Accessible Surface Area, Electrostatic Potential @ Surface). That result may suggest that STING can be further improved by expanding the characterization of protein-ligand complexes in terms of hydrophobicity and polarity complementary descriptors. Finally, considering STING’s performance on the PDBbind benchmark v2007, the de facto standard for validation of scoring functions, we believe that our binding affinity predictive model can be a viable option for rescoring purposes in virtual screening campaigns.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Artificial Neural Networks Analysis Used to Evaluate the Molecular Interactions between Selected Drugs and Human Cyclooxygenase2 Receptor

  Objective(s): A fast and reliable evaluation of the binding energy from a single conformation of a molecular complex is an important practical task. Artificial neural networks (ANNs) are strong tools for predicting nonlinear functions which are used in this paper to predict binding energy. We proposed a structure that obtains binding energy using physicochemical molecular descripti...

متن کامل

Molecular Dynamics Simulation and Free Energy Studies on the Interaction of Salicylic Acid with Human Serum Albumin (HSA)

Human serum albumin (HSA) is the most abundant protein in the blood plasma. Molecular dynamics simulations of subdomain IIA of HSA and its complex with salicylic acid (SAL) were performed to investigate structural changes induced by the ligand binding. To estimate the binding affinity of SAL molecule to subdomains IB and IIA in HSA protein, binding free energies were calculated using the Molecu...

متن کامل

Molecular docking study of Papaver alkaloids to some alkaloid receptors

Background and objectives: More than 40 different alkaloids have been obtained from opium the most important of which are morphine, codeine, papaverine, noscapine and tabaine. Opioid alkaloids produce analgesia by affecting areas of the brain that have peptides with pharmacological pseudo-opioid properties. These alkaloids show important effects on some intracellular peptides l...

متن کامل

Quantitative Structure - Activity Relationships Study of Carbonic Anhydrase Inhibitors Using Logistic Regression Model

Binary Logistic Regression (BLR) has been developed as non-linear models to establish quantitative structure- activity relationships (QSAR) between structural descriptors and biochemical activity of carbonic anhydrase inhibitors. Using a training set consisted of 21 compounds with known ki values, the model was trained and tested to solve two-class problems as active or inactive on the basi...

متن کامل

Novel consensus quantitative structure-retention relationship method in prediction of pesticides retention time in nano-LC

In this study, quantitative structure-retention relationship (QSRR) methodology employed for modeling of the retention times of 16 banned pesticides in nano-liquid chromatography (nano-LC) column. Genetic algorithm-multiple linear regression (GA-MLR) method employed for developing global and consensus QSRR models. The best global GA-MLR model was established by adjusting GA parameters. Three de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016